BIOTECHNOLOGY Ml ^ ^ 
SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: O 6 ! ^l ; /o(>A RECEIVED 

Source: / / b0$ 

Date Processed by STIC: / / ^ J?j\ 0j* JAN 2 1 ?003 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. „ - u prMTC D 1 finn/QQOfl 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: Itl/H OWN I &n 10UU/«UU 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://w>vw.uspto.gov/web/ofrices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User ManuaV ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or ^ 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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RECEIVED 



JAN 2 1 ?003 




TECH CENTER 1600/2900 



RAW SEQUENCE LISTING DATE: 01 /09/200...1 

PATENT APPLICATION: US/09/421 , 106A TJMK: i 0 : S 

I npu: Sot. : D:\SoyBac.txt 

Output S ( L : N:\CRF4\01092003\l421106A.raw 



<•'] } 0> APPLICANT: Ly.: aia, Joseph R. 

-ILL/- TITLE O L INVENTION: NUCLEIC AC I L MOLECULES AND OTHER MOLECULES ASSOCIATED WITH 
PLANTS 

<■ 1 3 Ci : E I EE EELERENCE: 3 8 - 2 1 ( 1 0 L 98 ) B 

< 1 -1 Pi ■ CURRENT APPLICATION NUMBER: 09AE21,]OOA 

■-i-n vuerenl eiil.no DATE: 1 9 9 9 - I 0 - I 5 

- I-.lL- NUMEEE DE SEQ IV LOS: 3t988 




ERRORED SEQUENCES 



Ooob Mot Comply 
^rtnKk«ttP Needed 



14 

1 o 

IV 

i 

E — > 20 
<> 26 



seo i d nd : ; 

HE NO Li: : i 4 V 
TYiE: LNA 

ORG API SM : C 1 _y c .i jria.x _ 



all n lea 



0>T P EH I : J -"<' 01 1AT I ON : unsu r e 
<400> SEQUENCE: 1 
actcattagc ttatggagaa gctttttctt ttta'actfnftc ttctcctatt agagcttata 

: . : < : a . : g ■ • * 4 a * o. • : . < gg cCdCt a t. a tdttctgcaa tctqqtactq tqcoatatat 
atggatggtg ghjbttggaca tttggat 

. pi Or :l;.) : c> no: < 

VI 1 L LENGTH : 4 2 2 
213: ■ I VPS: DNA 
. : I • • 3>R DANISH: Glycine m 




uns 



0 

d r e 



^ 2 2o7 

at alJ n locations 



E — > 57 

L 

-> 61 



at 



ccataccccc 1 1 1 c c c co; t 



0 



LL- ICViEE INFORMATION 
<4 00> SEQUENCE : 3 

. i q c 1 1 gt g -i a . 1 1 q t: q ack <j ^ .dinac i. g a a a a t a o cc c 

tcacaaactfi Jtgtggaatac tattgetact ccagaacaat gatctttagt taatctacac 120 

o ;t acq j t a ar 1 1 o ::a t oa t c taattacctt gcagacccta aaatcagaga agattgagt*. 180 

tqtiqtaqot cc-Oottjaag cagatgetea gtcagcgtao atgtctcago ttggagtaca 2 40 

aua t < jg sgoa gtag<:agcgg tgatcacaga agatagtgat ctaatagcat attfujetgtec 2G>0 

agctgtaaga actcctccaa tactgtgata ttgcgcatgg aggtttactg qmnctttgat 360 

a* co-go:?:, a t *. a ct t g t tcactattca qcttcataga aagcatgeat trfgggatat 420 
,;at 4 23 




V.Vl I." 





: v i i 


'.' E • 


.2 12 


7 9 • 


. . I 1 .2 


8 1 • 


■2 2 3 



SE«q 1 D 1 10: 4 
LENGTH : 4^2 
TYPE: DNA 
cEGANl SM: G2 yciru 



max 



0:THEE I N E0 RMATI C N : unsure at all n locations 
E--> 82 <400> SEQUENCE: 4 

8 4 * cecal otgt. t. c Lit a g t.r g tacaaaaaca caatccctat. cat: gga*. taa oacatoqaga 00 

80 gcattltcag cxaacccaag qcaccacttc t.tqraacaca rtaotggect. acacgoatog 12:0 

88 aaaoaqt.aca qcaqat gaaa at.qqtqqq 1 c aaa + . taacO. tcaoaalltq q'Oogoacaqa 1 L 0 
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RAW SEQUENCE LISTING 

PATENT API- 1, 1 CAT I ON : US/09/421 , 106A 



■ATE: Hi /au/; < 
T 1 ME : 1 0 : c )5 : : 



Cut put 



: D:\SoyBac.txt 
A : N:\CRF4\01092003\l421106A.raw 



90 aaCjjqrjot a t t t * rtr t ida cqaaccaUt aarqq^aqi'd uaa t qaat. ga gaa t ooagt g 

^ 9 . ? . t t a e * q g t e t. g q t ■ - a t 1 } g t. 1_ c <j c a g age t. g g a g a a ci a gat. t. t. a t : a g .a g c a 1 1. a c a a a t a c 

-> 94 tggtccagca acctatcagt agctntctgt aactagcata gatgggaagc tattgaccaa 

9C atgtaaeaiat. gt a ■_ e t agt c tgqattotaa eagagggaoc tteatceoao aeaglrat.ae 

9 H tectqi alt c 1 t. a - j t. a e c e e 3 g t a e 1 a t t e t. a t. a a t a t a a a t: 

i '. i <;: i j> 5ko i c :k> a 

10P 1 I : LENGTH : : 94 
i ! . : ' "MP. Mi PP: PNA 
1 ORGAN! CM: CLveiee ma> 

1 C- i.i 'CPA O'THEk I N F'ORMAT I aMI : unsure t -rt al 1 n 1 oeciTi ion 



24 0 

3 0 0 
360 

4 P 0 
4 CP 



PAe 4'THEk IN FORMAT ION : unsure t et all 
E--> 107 <400> SEQUENCE: 5 

1 n A agetA t.ceea eft t gaaoaa a t: a aeoot. ea geoaaat.aqa ateeatettq aqeettttt 



1 ] i o-aaaa.a 

lis \ ::a.::cA 

1 I o a oau( t 

i 1 ; •;a.:e-t 
1 'I ,; * jat - ca 



lc; ea ■. ~ aat. qqq a qaga aa t gt tcatrtaaag oataeaa gt. e eetaatat.ta 

et a a a. 1 1. t a gage teotagggag ea aaa<"'aat. g * gt. g t. ."A. :ct agagagagca 

aeoa eattt.gtttt t oe at: 1. 1 1 t.q tat:lt.qa1aa catatqgBan ttget.et.agg 

: • r ■ - aru'tgeatg :<"'. u * aa:Uqct t : geectet aat. gt. a ot.. to- agt. 

: Ud a o a> A a : ga a 1 gaoaaat tec te. gga.au e.i a aat at. >;q:aa eetaecotte 



> 121 ngcgggaggg cgacgcgtga etcgegggat gcgt 



1 



1 P. PPC ! C N'u: 6 

11: PkNCTH: 4 64 

i - ■ PEGANJ PA : G.1 y a.i no in -jx 

o OTHER INFORMATION: unsure at aii ra 



E — > 130 <400> SEQUENCE: 6 



ti 



P4 



394 



132 


n taagagga t gc tn taa tgg 


agganaataa 


agagagaagg 


ngggagcaca 


aaattgaagg 


60 


1 ': 4 




at a a a at ag aa. i a : u:aa: 


qqaa eat: t ga 


agt gt. gr et e 


a t aagaCt t t. 


eat t ea t. eaa 




1 






atgateetat 




ggt ,igau. tee 


1 1 gac a a get 




1 a :• 




; et 1 q ^ • : u a :a U • act; t. q 


agaaaet t at 


t r. gaaaaa ae 


t Aieattgaga 


a ggtagagct. 


P 4 a 






a g at a : a ca c.r u: e.a t. ec 


at. a set a age 


tcaactcAe 


g a q a a g 1. 1 1 1 : 


eat aaga aga 


.aea 


14.; 




*: e et a aaga aq- A aeaq at 


t ag at. aea a a 


taeet eret a 


atagct a age 






144 


gagatgggaa gctagagctn 


tgctacacac 


cenctatgat 


agctaagctc 


acccccatga 


420 


146 


caaaatacat ganaatacaa 


aaaagatccc 


tactacaaag 


acta 




464 


1 4 




P 1 4".: ■ PEQ ID NO: 7 












i" "j 




Pile LENGTH : / - 












i l : i 




A4: TYPE: PC IA. 












1 •:• e 




Pi A- okGAN IP: 14: OA y :: i r 


)C 










1 1 

_L -3 




4:2 other infmgaiatip 


44 : ansura a 


r a 1 1 n 1 o cat i o n s 






155 


<400> SEQUENCE: 7 












157 


agctntgaaa agtgttgttn 


ttcaccttct 


cgctaagcca 


atccgctggc 


ttagcgagcg 


60 


i : . 


t 


a: i.; 1 e t a a q a: c o a a a a c t e a 


1. 1: ggee a age 


g e ei a g g a a g a 


at et ggaaga 


aaa tgagetg 


IPi- 


1 p 1 


1 


aeaag '. t eg et tagcaeac 


-agtetegtet 


eaotaaqege 


aeegct t. eag 


t eea t eaget 


1 8 i : 


i »: o 




agegaoa a a cq< .ageget. 


aagccgaaat 


z c actaatgt 


gegct aageg 


gt ecagaa 1 1 


2 4 4. 


i > 1 




a:get a act g ca -gageaeg 


aacaaggcea 


e e t a 1. 1 1 a a g 


v 1 1 1 g a a a t e a 


g a 1 1 1 1 g t g a 


304 






ggqagt t t.g gg< :t. agg.it. t 


e a g a g e 1 1. 1 g 


eat. gt. ctaqa 


gat t et. a gag 


agagaaac^gt 


3 b 0 


: * 




■ t a at t : c a g a g < l 










37 3 


i - y > 




elPe PEC IP NO: 9 
4 1 10 .LENGTH: 4P1 













1j: TYRE: ONA 
1 3> PRC AN. I CM: G 



one m a x 
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RAW SEQUENCE LISTING 

}■ ATKNT APPi A CATION : US/09/421 , 106A 



DATK : 01 /(AV20(A J 



\ 1 1 p a ' 2 e V : D : \ SoyBac . tx t 

Output Sot. : N: \CRF4\01092003\I421106A. raw 



E — > 



4 



E — > 




2 0 1 


< 2 2 2 - 


CT HEP. I NKOP.MATIC 


N : u n a u r e a t 


a J 1 n ] cu 


vat. ; oca 








202 


<400> 


SEQUENCE : 9 














/ is] 


aqet tt. 


a a c t l V t : q T. < " a V c a 


a q a q a a t a t_. a a t - 




( .a uquat. a a 


qt AC 


3a t a a 


6 0 


2 1 1 1 > 




at. aa t c- ate 1 .*.:: 


a ca t a a t. c t t at 


i tcaacac 


cct t. ca at ca 


at. eta 


tcaat 


1 iV 


.A i r 


at ct t <: 


t t.ta atctLt A:ca 


a cat. t i t caa c<- 


iqat.ett. t. e 


1 C J U I . i. . a 1 . i I 


cac;A.T. 


-at at 


A P (" 


2 I : 


V t. ct. a a 


cagt t tt t cr\ vcaa 


t. aqt t.. t. ct ct t 
ctct tctccc * t 


aaa aqaaaa 


q t t. c t. t_ q t t 


caaaa 


act t.c 
:qcct 


z. -4 L 




aqet. at t < :,di c; .u 


t a c caa a a 


qaa qq^^qga 


eta ac 




214 


gaattttttt gtgtctctct 


tctcccttac aaaagattca 


naggactaac 


cgectgatat 




2 . 1 ; 


at ctq.t 


t ct t. t. ee.a-atuca 


a a qa t : t a a a q> 


j a e I a a c t. q 


cct. qaqaa 1 1. 


Ct t. tq 


v. c c c a 


/i ■ a" 




.a 














l i i J 


) ■ ~ 


< 21 0> 

< 2 1 1 

-a'A2- 


2EA I D NO: : V' 
: ENG1H : I C. ■] 

' A -A: A . A : 'AA/cir 


A^ a7 


oC 










a ■ t , 




(AAPEP INPPO.IatATK 


N: unsure at. 


a .: u it ] ct 


:at i ens 








227 


<400> 


SEQUENCE: 10 
















vat. ct a 


c act cc< • c 4 uaca 


gt acct t cat u( 


t aegqqqca 


qqaccat caa 


ca qcc 


aqccq 




■ ; - *' ■ 


' :a. : * a 


• • • a< a : : - - a •■ a 


ct aca t. caqc >e 


aaat :vt t 


aeaqt. teat c 


t <aiqc 


V Ca^a 


1 ' " 
J . 




V qcaca 


cga: tgauctccat 


rA.qcatqc.ct .A 




t qt qg t tgar 


Cci a t a 


gq cqq 


1 ' ■ ' 




' - ' a • ; • c 


at eg qcqataqqrg 


<t a q ct. qa a : c a < 


i<jO ct. 1 1 1 a 


ccaqt ac a cc 


e t. acq 


v cage 


/ i ■. 




agaof t 


a qua t c< tt. 


t act cat ggc 1 1 


C cccqa 


qt. agt V t qqq 




iqt t. q 








c t ;g a cut aaqc a: 


a a 1. 1 1. 1 caaq 1 < 


iqqq aca aq 


accct z aaaq 


q CCCC 


aqqa q 




241 


ttgaagatgg agctcaagaa 


gacgacgaca tangegatgt 


gatg 






404 






: ?;A l 0 ]],; : ■. ;-; 


















L ENGTH : - •« 
















A'-:l ;a ■ 


1YPE: DNA 
















' 2 1 A 


ePGANIcM: • . . yc .1 r 


e max / ^ 2-C ? 












. ■ , o - . 


202-EP 1 1 j FOAMAT 1 2 


■N : unsure at. 


a 1. 1 t: I oca t .i oris 








290 


<400> 


SEQUENCE: 13 
















aqct * q 




at.tcatt.tcc c 


/■-[it , ■ 1- r - + 


t q (.: 1 1 t a a a a 




:caa t 








ate: g ttt:t.,qv gc 


c a 1 1. a 1 1 c t. a at 


caa c a t a q 


t q qa t qa t a t 


tgctc 


t a a a c 


a;. : 



296 aagattgetc ttgcctttga ctttctctat ctcctctcgn gatttttttt atttgagcaa 180 

"2 '-»- ccqt'qatva tatc >:M: aqgq qtqqaacttc qtatatqtct t.taatatct.t ccaataqatc 2-;0 
W--> 300 acaagcatca agatagggtt ccgttctaat agectagagg tggtaatgtt ntccattgaa 300 



qt eaa a 



:tav -a, 



a 21. ij 
c211 
■22}.- 

<212 
C222 



E — > 




v^c.a 'tggacacct:t aqtcccata 2 4 

PP'J ] D N2: It 
I.EPJGTH : 

TYPE: AAA , ^ ^ 

ijMANISI-I: 'Aytire max — ) ^1 <~ / 

a cTPEF. INPOAMAT ION: erasure at ail. n locations 
333 <400> SEQUENCE: 15 

335 agettgeata actntgaatg gngtattggt agagtttatc cgttaatgat atgggctatt 60 
337 gagttgggga ggattgattn tggaacttgt cgtggtgcag aagttagttc aagtgcgaac 120 

1- i uctaetaga.a aa-naqctt: ttgcqatqca cttacgacat. cqqtccaaca aaactqtccga 1 
;--A aqtatattaa a:<i AqcA: tqtqtaatPa caacqaaaqt. qtqcattttq ccaattttat 2 



qqtt(iat'avt. qq 



a a a 



:A.vq qqqv.att 



"j v t a ( \ q q t a g t. a < i a 1 1. 
AA 0> SKQ ID NO: Id 



ccttgaaggt tqtt. qqaaoq qact cgaqag t qaqqaaact .3 
qa 1. 1 teegta acatacttaa tqctcteaca acataqtqqa 3 o J 
catt 38 4 
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RAW SEQUENCE LISTING 

PAT ENT A P PI , I C AT 1 0 N : US / 0 9 / 4 2 1 , 1 0 6A 



PATP: 0; / 0 P I 2 C ! 
TIME: 10:PPMP 



r i p u ! P e 1 : D : \ SoyBac . txt 

ajapat. Sc. : N:\CRF4\01092003\l421106A.raw 




406 
W — > 408 

■PI M 



•i 1 



> 431 

W--> 433 



:PT 
ii.il" 



: 3 . .. 1 J .i'i 



,1 c :. t ,-.ar. 1. 



at. qat 



• Villi- PEuPTH : 44/ 
■AMP: T V PP: PNA 

< ; 2 3 : ■: hgan ISM: G 1 yc : ne x </ «2£?V* 

-PPM MMIER LIIFOKMAT:c;N: u.'iSJie at. all a locations 
<400> SEQUENCE: 18 

ctnggtgttg ttcctattgt gcgagttact gaggtgcaat ttcaatttta attggataat 

}.i : ;<:-;• a * \ a q c a a t a t . a t ^ t. a- aaq t a t. qa cactqcatca cacact t tat tatt.tgeoac 
a.r;.t. 1 a: taqq ^aat :va-aaa att.t i gt ggg t tctat.taet. tatttaat.ga ac t P ca t eg 
t gat t t : aqa attt :t. : i:a aa 1. 1... 1 1 -usee aataat.aat. a at. aa t. aqaqt, q t qt. t a :;t t a 
jaaqc: 1 ] :;P t ]tat a;'".ao ca.:t ^ct^tt. gaagt at. age a t caaa cat. gaaaggaatt 
ccattttaag tattatcctg taccanaacc tcactttagt ccccaatttt ggaaatcaca 
gttcttttca ctgacaaatg acttacagtt ntagttaaaa atagggatta acaagagtgg 
ag • ; • a a : acoaziq.; : act 
< P 1 ■">;■ PPi.) 1 p no: : * 

. . ! ] :p:]gtp: • w 
<PMo: I yi p : piia 
■Mm tPPANIPM: G.ycMne max, ) <2 

-M :- ; .PMa oTPEF. J IIPG1P P.TPJN: unsuie- at a 1 .1 a locations 
E--> 419 <400> SEQUENCE: 19 

- :. * t .a a : : t a c dC ai: . . a a a c a t t. a t. a a t o c • c 1. 1 c a a t. a g t. a g g a 1 q a g a g t a t g c c t c 

* : ? t t c.a -.ii ■.;*,; i: '3 1 a 1 1 g t. g . v ' t a , uotagaaac aagaat. aaga gaaaattaaa 

a qat.t tqt t 1 1 :t ttttiP. tt. t. tt. t gtaat. z 

: at taaqqqqc ctttcagaag aact tg 1 . qaa 
c ttcaaqtgaa aagattttta tactatgaac 
taaccaaaaa tcatcctatg attnttaata taattattat aaaattacca tacatcataa 
tttgagaatg tagaanacat aaacaacgtt tacact 
" t: SJPP 1 0 IIP: pa 
: LENGTH: 4 : M« 
P TAPE: PIIA 

; ORGANISM: C/cne max )Z 2 2^/ 

•;■ O'MIEF. INEOF.MATION : unsjre at a J .1 n locations 
E — > 442 <400> SEQUENCE: 20 

gtcctcgggc cattcctgcg aaggaaaaca tttggatagt tagttntacc aagaaatget 

aoo ::t t auaa caaaaa* ^qc atacaa:ct.c ctccaataaa tacaaacatc aacgtaaata 
P- t a- ; i:cc-= ::qc 1 1 a 1 ao; ::at at 1 1. tct. ; ac caaoattcac tegcacaaqa tactccccta 
a i aotaaqcaaa at(p.5 0>::at qcacaatcaa ggeacttteg ttacctacat tacttgtatg 
P' t. a -'.v. * c aaag * .cc t:aca:c3Cdt goatttcctt gqetaaattt. a:atacatgc 

PI at go* caaaq cct ■:■ : r g jet accaaaaqtt qcacacatgc aaactttatg atgaar :ttg 
:o ^pi P ctr.:a raataag :jtg ctacacttca tgctttatat :aact g 1 1. tt actaccagaa 
> goo-:ica:acg aat ^toivjta tat 1 1 z at. 1 1 tgecgacta 
P <:.: ) ■■ ■ SEQ 1 P 1 10: P 1 
P <PMP1P PEMGTH: :>2z 
2.: ■ .P ^ TYPE: DIIA 

M • . : • uP.CAMSM: G.ycMae max ^ )Z / <^2^7/ 7 

.Pt.- GTHEFi I NFOF.1 1ATTON : unsure at, all n locations 
E--> 467 <400> SEQUENCE: 21 

■It) a accttctccc ccaa 1 1 1.. t ct. ataaataggg qqagaaqtqt aqtaqaaaaq c^qt t eaq taaa 
■I'M i'tPagqcal t c t. c a: -at. t. teqaatttge M.aqquaaat tgt.ttccgtg aaqaaaat.ee 
4 Pa aagcegaqge gettcegtaa egtt.tccqt.q aqtqattttq cga agg 1. 1; 1 1. eqacoqt.r.ct. 



^^>^444 



60 

1 2 0 



360 
420 



360 
396 



60 



4P" 



1 b L 
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RAW SEQUENCE LISTING 

i'ATKNT AAALlCAlAoN : US/09/421 , 1 06A 



I -ATI-: : 



ii.r u: OA a : D:\SoyBac.txt 

On; pa: : N:\CRF4\01092003\l421106A.raw 



*--> 475 
/Vff-r> 477 
<*U4-^> 479 



48.! 
18 \ 



E — > 488 
7 > 490 

■1 *a 



E~> 531 
V^-^C533 



tcgacgntct tcattcgttc ttcatcgntc ttcagtcttc aacgggtaag tacctcatac 

caagcttttc aattcattct atatacccgn nnggggccac attatggttc atgtattatt 

attctcgntt catttactct ttataccc 

Oily ID NO: 12 
C21 1 ■ LEi JGT H : ': 91 

<Al2 ■ T Yr'E : DIJA / ? ^ 

<.:13- tAAANIOM: A vrine :ta>:^. ) <C_ <UD / 

aa2 3 =:» I riEK 1 N A3A11AT I ON : unsure at all n Jecations 
<400> SEQUENCE: 22 

ggctctanat ntacattgat gtttgtattt atgggaggag gttatatgee atttttgett 

'.-j a^. : q: ; : qt ' : eta : qq taaaaetaac t ! t ccaaatq tttgeettcq eaqqaatuea 

<" : j t 7 ct aa a agaggt ecag gaaggacaag gcggcegaag gaactagttc 

egc :■ c :i :i ■> g t a < : a :■ j ql e accqrt t tag gagcgttgta ca:caqcag: gct.tcgaagc 

■:a t ?. :: a : a t g q -A c t c uga q a a eg acgcgtccag ctcatqqacg a cgaq tai t. 

* • ; i * * * ■ - : : :« : j ■. .-. C a : qgagecqacg qtaqqaaeca ttgqttaetc era t tag aac a a 
A - t - ;a :. ■: •■ * a ga< ta : a y cc ttgagittta • 

a i o : r.) a : i 

■A 1 i :a : NAAH : 4 
■A 1 : I V r Z : ANA 

■A 1 :i ■ • A-;a:A AIA Aycir.e n;ax^ ^ ^/ J> Z^T? _ 

2 - tAAYA. 1 1 1 At > AM AT 1 All : uii.sure at all a locatiana 
<400> SEQUENCE: 24 

ggcacactct ntgattatct tggtctacca agtgtttatt acacaatagt gaaatgeact 

* : r * . -caa i r ::t t at get c a a tea et qaa t ctjaattdtc t. ccagccac:: ea a a taa aa 
ca^.Aag.jg gtc:aaaaaag gaaasaaact /iacactqcca ataac^ccaa qiteeaggtt 
*: * a o s a t * a t :tat t at ea a a a ■ -a uattctacaa cct attatac aaacataact 



a.^;a;a a a a ne : a 
- ; c haaaa - * a ca 1 t a 

. la. qh v : D NO: 
."ll I.ENGl H : A7 
LA- TYPE: ANA 
•: A A AN I oil : 



tat a at a a a a a ■ -g gatlctacaa cct attata 
tag taaaaaaaaa a aaa aaa a at ttacqtcac 
aaa attqtaqqtt tqtaccctgc agt :tgcac 
att. aaat taatta at cgattaat: actaccatc 



g t q t t... t c c a t a 
a agt agtacca 



a_) A A A^ 

aaure a 



E--> 590 
W--> 592 




— > 602 



<•■■{_ 

C - ' l 

A A 
6)4 

E— > 635 



:. yc j r.e ma 

MATICN: unsarfB* at. all n JoCdUiona 
<400> SEQUENCE : 27 

agcttctccc ccaattntct ataaataggt ggagaagtga agtgaanaag ggttcagccc 

A :a.-gc.v:: a c\ :t r.ctt tcgaatA:Agc t tqgaaaaai:. tq*-:t aegtg a ag a aaa tec 
a 7 :gc<" gag ; ;: get: atjaaa cqtttccqea acgtttccgt gaggaatttc gcgaaggttt 
■:■ iaca -gt t -ai: tcaja -tata; ct: tcattcgttc ttcatcgttc ttcgatcttc aacgggtaag 
r.rac aaa.:.c caauct^ttc gattcattct ategtacctgt ggtggtccac attgtggttc: 
gtggattttt attctcgntt catttacttt ctataccccc ttttgacgtg gettaageca 
a t tat 1 1 d,q : .. . 1 1 1 1 etc 
1 'A ■ Alcq 1 D NAa A 1 * 
1 A TAAJATH: 40: 
: . • TY ?}•]: ANA 

L3> ORGAN ISri: Glycine :t c-:_ ) ^ 2 2_j3? 

2a> OTHEA 1 NFC F.MAT ] CN : unsure at. all n i oca t ions 
<400> SEQUENCE: 2 9 



240 
300 
328 



60 

1 8 0 
A 4 A 



A a ] 



60 



60 

1 2v. 
1 8 A 
24 0 
200 
360 
37 9 
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RAW SEQUENCE LISTING 

PATENT API l.J CAT I ON : US/09/421 , 106A 



PATH: 0 J ,/0 ( VP0(P 
12 MP: 10: P 3 : 3 7 



D : \SoyBac . txt 

N: \CRF4\01092003\I421106A. raw 




37 

E--> 658 
Wt--> 660 



A 



agcttanagg agcactcana tcgggtgtat ttaaccccat 
cgtcagggcc tctccctcct gattcaggtc caacccanaa 
ntatctatga actgtacaaa atacacgact cctcaattgt 

dt.cqcgct.tu t. cut. t aa^et cgtcagqt cc caacagtggt 
cgea t. taact cctcqccct t. agattcat.au t tcacaadl c 
g c a c a l a t a t a t * a ca a etc ^at acata^t c a a t 1. 1: a t. c a 
aagt.ggta'a ate: ::aat t t. oacatgttat cacacct rat 
< 2 1 -j : GEO IP NO: 3 3 
< 2 'l 12 [ PNC -TP : 4 53 
<. 2 1 P: . TYPE: PI1A 

<P I 0 ■ ORGANISM: G.I yci rae max. -) <L 2 Z{3 S 

<32 2- OTHER J NPOPMAT J ON : unsure} at all n lot: 
<400> SEQUENCE: 30 

ntaactnttc aatctctctg canataaata acaagattac 



2 



Xa » 



a t * ■: 



g,t at 
:g:t.at. 

' t 

: a a ccc 



a g i 
au< 



a .a a <;.: a g 
at ac.eg 
; c t u a a 



W--> 



> 7 c 
672 




at. ; a. ; a j ; ia t g< :a :gaaa'"' 
tgcatacaca taaaccanat 

a act a " gac^t aagaaat a ac; 

<312- LENGTH: :-98 
aPIPe IYPP: DNA 
<P 1 2 ■ ORGAN! SM: '2 y< 2 : 
■02/ 3: ■ OTHER IN OTP MAI ; ( 
<400> SEQUENCE: 31 
;g"a tag:., a ca ■: :: ajt t. ct t 
t. a t. 1 c a 1. t a g a c a t age u t. c 
t g c a 1 g a a a g a g : a ■:: ■;: 1. 1. g t 
•■ a gap gat ct t t tt art.agcaa 
actggtgagt tggaegcata 
tctacttt a t. c t z a g t. a g a 
tcattaag t. c 1 t a c aj t. g a a t 



ia:tt.ga,P a aaa teat gag 
u t t. t. ^a a g eta a a a :: a a a t. q e 
e 1. 1. a a a t a I c t aca t a a a e a 
t g c a a g t a a t . a t. g c t. g t . 1 1 e 
; • c a c: a g a alt a a g t c a t, a a a 
cctaccattn taattntaca 



ggectagact 
aacattntag 
tctcaaaata 

t eeeat eat a 
agggeacar.i 
a a. a a at t t g 
gaaa a at. a 



:a t teas 

ttatatatca 

gtcgaat + 
aaggca g eg: 
gaat. at t eoa 
a t at 1. 1 at ea 
■.ac t g a a c t Ca 
ccctccccan 



ccgaagagtc 
cacacagact 
attttatcta 

a t. act. eqee a 
aeat et euat 
gt. ct caat aaa 



tattgagatg 

t tgaaat act 
aag aaa t. aaa 
t aa agat t at 
aaa I at agga 
a e ■_ a c a t < m g 
acccacaatg 



N : ur.su re at all ra Joe 

e: gaaaetag tccaact.t.tt 
a t c t a t... g t a c t \ t g g c 1 1. g a 
e a g g a e 1 1. 1. g t ■ z< a t a a g g a t. 
t c a t c c a g 1. 1. g g 1 1 e t. ca t. g a 
ttggtcttga ctggacctag 
a g a c t. c a t. e t agct:ca a c a 
g g c c t c 1 1 c c a c a g t e a t 



cctaaat tge 
act eg a aaa g 
■a a auaa a t. qa t 
ctcatagagg 
caacatattt 
t.tgtagtet t 



t.. 1. 1. a a c e e a g 
e a a a g 1 1 c t a 
et gggaet ct 
tt gat. eat c c 
cacgatattn 
aaeet tat. t. a 



60 
120 
180 

7 o 

."UIM 

P r> [J 
A0-- 



60 



420 





X:«/'7,^^2.2: 1 ^- 
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<210> 
<211> 
<212> 
<213> 

<400> 



C 




gtgtctttcg gatgcttctt ct 



<210> 36938 
<211> 22 

<212> QNA — ____ 

<213> c Artificial Sequence 



<400> 



36938 



caccattttg cacctaagtt ga 



RAW SEQUENCE LISTING ERROR SUMMARY 7'A'i'K : o;/(;4/::00 

IATKKT ArirL; CAT ] ON : US/09/421 , 106A NIK : :u::/7 : 2fc 



Aa..; :A,t : D:\SoyBac.txt 

Oiii.rut Set : N: \CRF4\01092003\I421106A. raw 
Use of <220> Feature (NEW RULES) : 

Sequence (s) are missing the <220> Feature and associated headings. 
Use of <220> to <223> is MANDATORY if <213> ORGANISM is "Artificial Sequence 
or "Unknown" . Please explain source of genetic material in <220> to <223> 
section (See "Federal Register," 6/01/98, Vol. 63, No. 1 04 , pp . 2 9631 -32 ) 
(Sec. 1.823 of new Rules) 

Seqif : 36937, 369'<9 
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